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Algor ithms  that  manipulate  sata  of  ractanglaa  are  of  grant 
practical  importance  m  VL3I  daalgn  ayataau  and  other  applioa- 
tiona.  Although  much  theoretical  work  haa  appeared  recently  on 
the  complexity  of  rectangle  problems,  it  haa  aaauaiad  that  the 
inputs  are  given  aa  a  Hat  of  ractanglaa.  In  thin  paper  we  study 
the  complexity  of  rectangle  problems  when  the  inputs  are  giver, 
m  e  hierarchical  language  that  allows  the  designer  to  build 
large  designs  by  replicating  mull  designs,  we  will  see  that 
while  most  of  the  problaaia  are  TO-hard  in  the  general  oaae.  tnere 
are  0  N  Ig  M)  algorithms  that  process  inputs  obeying  certain 
raatncticna. 
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i.  I  NT  ROOUC  T J  ON 

Algor  lthms  that  manipulate  sat  a  of  ractanglaa  In  tha 
plana  ara  of  graat  intarast  to  practitioners  and  theoreti¬ 
cians  alike.  Practical  applications  of  such  algorithms  arise 
in  such  areas  as  computer  graphics,  architectural  design 
systems  and  VLSI  (Very  Large  Scale  Integrated  Circuitry) 
design  systems.  These  problems  also  have  great  appeal  from  a 
purely  mathematical  viewpoint)  the  problems  are  quite  natural 
and  easy  to  pose,  yet  the  solutions  often  have  a  rather  sub¬ 
tle  structure. 

Much  theoretical  work  has  been  done  recently  on  rectan¬ 
gle  problems*  we  will  return  to  a  brief  survey  of  that  work 
in  Subsection  3.1.  Almost  all  of  the  work,  though,  has  con¬ 
centrated  on  rectangle  sets  chat  are  defined  by  merely  listing 
all  the  rectangles  they  contain.  Although  this  is  quite  appro¬ 
priate  for  many  applications,  for  many  others  this  model  is 
terribly  inaccurate.  VLSI  circuits,  for  example,  are  usually 
not  specified  by  simply  listing  their  components,  but  rather 
are  described  in  a  hierarchical  design  language  that  allows 
a  designer  to  build  big  designs  in  an  orderly  way  from  msall 
designs. 

One  way  to  process  the  hierarchical  designs  is  merely 
to  throw  away  their  structure  and  treat  them  as  though  they 
were  given  as  sets  of  rectangles.  This  can  be  quite  costly, 
however,  because  such  designs  ocn  describe  an  exponentially 
large  number  of  rectangles. 


With  this  motivation  several  researchers  have  recently 
begun  to  investigate  the  problem  of  dealing  directly  with 
a  hierarchical  description  of  a  set  of  geometric  objects i 
see,  for  exaaple,  Hon  [1980]  and  Whitney  [I960].  The  approach 
that  they  have  taken,  however,  is  a  solution-oriented  stra¬ 
tegy  that  has  not  led  them  to  Investigate  many  of  the  theo¬ 
retical  questions  that  arise  in  this  endeavor. 

The  purpose  of  this  paper  is  to  provide  a  sound  theo¬ 
retical  basis  for  the  important  problem  of  manipulating 
hierarchical  descriptions  of  geometric  objects.  In  Section  2 
we  will  define  the  rectangle  problems  we  will  study  and  then 
consider  the  various  forms  in  which  their  input  might  be 
given.  In  Section  3  we  investigate  the  complexities  of  the 
[seven)  problams  when  their  inputs  are  given  in  (three; 
different  representations.  Finally,  implications  of  the  re¬ 
sults  and  directions  for  further  research  are  studied  in 


Section  4. 
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2.  PROBLEMS 

In  order  to  state  a  geometric  problem  precisely  and 
to  measure  its  complexity,  we  must  specify  the  language 
which  is  used  to  describe  an  instance  of  the  problem.  In 
this  section  we  will  first  give  a  list  of  rectangle  problems  and 
their  applications  using  geometric  terms  in  their  intu¬ 
itive  meaning.  We  will  then  introduce  a  hierarchical  lan¬ 
guage  which  allows  us  to  describe  the  input  and  output  of 
each  problsm  and  to  measure  its  size. 

The  first  problsms  in  our  list  are  intersection  problsms: 
we  are  given  a  set  of  rectangles  in  the  plane  (with  sides 
parallel  to  the  coordinate  axes)  and  we  ask  for  intersections 
among  these  rectangles  or  with  other  given  objects. 

Two  rectangles  are  said  to  intersect  if  the  interior 
of  their  intersection  contains  at  least  one  point.  Thus, 
the  Intersection  includes  both  proper  edge  intersection  and 
the  inclusion  of  one  rectangle  within  another,  but  not  two 
rectangles  that  touch  only  at  the  border. 

- .  ?a?or?  I'ittPBeatinp  Pairs 

Input:  A  set  of  rectangles. 

CXitput:  A  list  of  all  intersecting  pairs  of  rectangles. 

Application:  This  is  an  important  task  in  the  'Vecmetry 
engine”  underlying  most  VLSI  design  rule 
checkers:  see  Haken  [1980]  or  Hon  [1980]. 


Input:  A  set  of  rectangles. 

CXitput:  Yes,  if  there  is  at  least  one  pair  of  intersecting 
rectangles  in  the  set;  otherwise  no. 

Motivation:  This  is  a  more  math«natically  tractable  version 
of  problem  1 . 

c.  Inzer seazicn  -jizh  2  liters  Zb,jeoz 

Input:  A  set  of  rectangles,  and 

a)  a  point,  or 

b)  a  line  (parallel  to  x-  or  y-axis) ,  or 

c)  a  rectangle 

Output:  Yes,  if 

a)  the  query  point  lies  in  the  interior  of  at  least 
one  rectangle  in  the  set; 

b)  the  line  crosses  the  interior  of  at  least  one 
rectangle  in  the  set; 

c)  the  query  rectangle  intersects  at  least  one  rec¬ 
tangle  in  the  set;  otherwise  no. 

Applications:  a)  This  allows  an  interactive  user  to  point 
to  a  rectangle  and  have  the  system  retrieve 
it  for  him. 

b)  This  arises  in  routing  (see  Lauther  [1980]). 

c)  This  tests  the  validity  of  placing  a  rec¬ 
tangle. 


4.  Northernmost  Rectangle  Below  a  Line 

Input:  A  set  of  rectangles  and  a  horizontal  line  y  »  y  . 

*  o 

Note  that  an  important  special  case  occurs  when  y 

o 

is  greater  than  all  y-values  in  the  set. 

Output:  A  northernmost  rectangle  in  the  set  below  the  given 
line. 

Application:  This  is  related  to  the  routing  problem  of 
Lauther  Cl  980]. 

5 .  Measure  and  Perimeter  Problems 

Input:  A  set  of  rectangles. 

Output:  The 

a)  measure  or 

b)  perimeter  of  their  union;  i.e.  the  total  area 
covered  by  at  least  one  rectangle  in  the  set. 

Motivation:  These  problems  were  raised  in  a  theoretical 
context  by  Klee  [1977]. 

S .  Conv.eatedne33  Problems 

Input:  A  set  of  rectangles. 

Output:  a)  Yes,  if  their  union  is  a  connected  set,  and,  no, 

otherwise 

b)  the  number  of  connected  components  in  the  union 

c)  a  list  of  all  connected  components  in  the  union. 
Applications:  Connectedness  is  a  central  notion  in  VLSI 


circuit  extraction. 
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Zcualizu  and  Subset  Testing  of  Rectangle  Sets 

Input:  Two  descriptions  and  D2  of  sets  of  rectangles. 

Output:  Yes ,  if 

a)  the  set  denoted  by  is  equal  to  the  set  denoted 
by  D2 

b)  the  set  denoted  by  contains  the  set  denoted 
by  D2 

otherwise  no. 

Observe  that  these  problems  are  meaningful  only  if  a  language 
to  describe  sets  of  rectangles  has  been  fixed. 

We  will  now  specify  a  very  simple  hierarchical  language  for 
describing  sets  of  rectangles.  This  Hierarchic  Input  Language 
(HIL)  may  be  considered  as  a  proper  subset  of  the  Symbolic 
Layout  Language  defined  in  Mead  and  Conway  [1980,  Section  4.3]etx: 
of  CIF  (see  Section  4.5  of  the  above). 

The  HIL  language  describes  sets  of  rectangles  as  collec¬ 
tions  of  boxes.  Box  commands  describe  each  of  these  boxes  by 
specifying  the  x,y-  coordinates  of  the  lower-left  corner 
and  its  length  and  height.  We  may  assume  that  all  coordinates 
and  lengths  are  nonnegative  integer  multiples  of  the  layout 
unit  X.  For  instance,  the  command 

Box  (0,0)  (1,1) 

describes  the  unit  square  with  lower-left  corner  at  the  origin. 
The  general  form  of  a  Box  command  is 

Box  (X  Coord, Y  Coord) (Width, Height) 
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In  HIL  we  can  also  define  symbols  that  denote  sets  of  rec¬ 
tangles.  A  symbol  definition  is  a  symbol  number,  followed 
by  zero  or  more  attributes  and  a  list  of  Box  and  Draw 
commands.  A  Draw  command  has  the  form 

Draw  <  symbol  number  >  at  <  point  > 

<symbol  number>  must  be  the  number  of  a  previously  defined 
symbol,  and  <point>  is  a  pair  of  (x,y)  coordinates.  This 
command  describes  the  placement  of  the  symbol  denoted  by 
the  symbol  number  at  the  given  point.  To  make  this  more 
precise  we  consider  every  defined  symbol  as  "hooked"  to  the 
origin.  Thus,  drawing  a  symbol  at  a  point  means  to  place  its 
origin  on  that  point. 

We  give  an  example  of  a  symbol  definition  with  empty 
lists  of  attributes: 

1  :  Box  (0,0) (3,1 ) 

2  :  Box  (0,0) (1 ,3) 

3  :  Box  (4,3)  (1  ,5) 

Draw  1  at  (4,4) 

Draw  1  at  (4,6) 

Draw  2  at  (6,4) 


Symbol  3  is  as  follows: 


We  consider  it  as  hooked  to  the  origin.  The  set  of  rec¬ 
tangles  denoted  by  symbol  number  3  consists  of  two  copies 
of  box  1 ,  one  copy  of  box  2  and  one  copy  of  the  box  in 
symbol  3  placed  in  the  plane  as  shown  by  the  above  figure. 

We  now  introduce  the  notion  of  a  bounding  rectangle,  or 
BR,  of  a  symbol:  A  bounding  rectangle  of  a  symbol  is  any 
rectangle  that  includes  all  boxes  in  the  set  of  rectangles 
denoted  by  the  symbol.  A  BR  is  specified  by  the  coordinates 
of  its  lower-left  corner  and  its  width  and  height. 

Thus,  the  rectangle  with  lower-left  corner  at  (3,3), 
length  4  and  height  5  is  a  BR  of  the  above  defined  symbol  3 
(note  that  it  is  not  the  minimal  BR) . 

HIL  allows  us  to  augment  a  symbol  definition  with  an 
attribute  which  specifies  a  BR  (not  necessarily  the  minimum 
BR)  of  the  symbol.  Thus,  a  symbol  definition  has  the  general 
form: 

<symbol  number> :  {BR<point> (<length> ,<height>) }  ^ 

* 

{<box  ccmmand>}0 

* 

{<draw  ccmmand>}0 

We  will  assume  w.l.o.g.  that  the  set  of  rectangles  (i.e.,  boxes, ) 
defined  by  a  HIL  file  is  the  symbol  denoted  by  the  largest 
symbol  number,  which  we  will  often  call  the  root  symbol. 

Symbol  calls,  i.e.  symbol  numbers  occurring  in  draw  commands 
of  a  symbol  definition, may  nest.  That  is,  the  definitions  of 
the  called  symbols  may  contain  calls  of  other,  previously 
defined  symbols  etc.  Thus,  we  can  associate  to  every  symbol 
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definition  a  call  graph  which  reflects  the  hierarchical 
structure  of  the  symbol  definition.  The  above  definition 
of  the  symbol  3  has  the  following  call  graph: 

th 

1  2 

Note  that  the  call  graph  is  always  a  directed  acyclic  graph, 
which  we  abbreviate  as  a  DAG. 

We  define  the  length  of  the  HIL  description  of  a  symbol 
as  the  total  number  of  Box -  and  Draw  commands  occurring  in 
the  definition  of  the  symbol.  Thus,  the  length  of  the  above 
HIL  definition  is  6.  Observe  that  we  do  not  count  the  length 
of  the  numbers  (symbol  numbers,  coordinates,  etcj  occurring 
in  the  definition.  (We  will  return  to  this  point  in  Sub¬ 
section  3.2). 

One  easily  observes  that  the  same  set  of  rectangles 
may  have  very  different  HIL  descriptions,  whose  lengths  may 
differ  by  an  exponential  factor.  We  may  give  a  description 
of  a  set  of  N  rectangles  in  the  plane  not  using  the  hierar¬ 
chical  structure  of  HIL  at  all  by  'writing  N  2ox  commands  in 
the  definition,  one  for  each  rectangle.  This  description  of 
a  set  of  rectangles  is  obviously  equivalent  to  the  usual 
assumption  made  in  much  previous  work  on  geometric  problems, 
namely,  that  the  set  of  rectangles  is  given  by  the  set  of 
coordinates  of  their  corners.  We  will  call  this  a  purely' 
geometrical  description  of  a  set  of  rectangles. 


When  an  KTT  description  of  a  set  of  rectangles  uses  BR  attri- 

_ tes  they  should  often  be  consistently  assigned.  That 

means  whenever  a  user  specifies  a  BR  attribute  in  a  symbol 
definition  it  should  always  contain  the  minimum  bounding 
rectangle  which  includes  all  rectangles  (boxes)  denoted  by 
the  symool.  Clearly,  the  minimum  bounding  rectangle  for  a 
set  of  rectangles  denoted  by  a  symbol  in  HIL  can  always  be 
computed  by  following  the  hierarchic  definition  "bottcm-up" . 

By  imposing  stronger  constraints  on  the  symbols  which 
we  might  use  to  define  new  ones  we  can  restrict  the  ex¬ 
pressive  power  of  HIL  considerably.  We  may  use  the  BR  attri¬ 
butes  (or  other  attributes)  to  state  and  check  these  con¬ 
straints.  As  an  example  we  will  single  out  a  restricted 
version  of  HIL  which  allows  us  only  to  define  sets  of  rectan¬ 
gles  which  do  not  contain  any  pair  of  intersecting  rectangles: 

Let  us  assume  that  every  symbol  definition  is  augmented  with 
a  consistent  BR  attribute.  Then  we  require  that  all  boxes 
and  all  Bl;'s  of  the  symbols  occurring  in  Draw  commands  of 
a  symbol  definition  do  not  intersect.  Thus,  under  the  assump¬ 
tion  that  all  called  symbols  denote  sets  of  nonintersecting 
rectangles  the  same  holds  true  for  the  defined  symbol.  Let 
us  call  the  thus  restricted  version  of  HIL  consistent. 
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3.  COMPLEXITY  RESULTS 

In  this  section  we  shall  study  the  complexity  of  the 
various  rectangle  problems  under  the  various  formats  for 
input.  The  primary  results  for  this  section  are  summarized 
in  Table  1.  In  Subsection  3.1  we  will  review  the  results 
shown  in  the  first  column  of  Table  1 ;  that  section  is  just 
a  survey  of  previous  work  T.n  Subsection  3.2  we  will  study 
the  results  of  the  third  column  (in  which  the  problems 
have  unconstrained  HIL  input)/  and  in  Subsection  3.3  we 
will  study  the  results  of  the  second  column  (in  which  the 
problems  have  consistent  HIL  input) . 


i 


.Input 


Problen  V 

|  Geometry 

Only 

Consistent 

HIL 

General 

HIL 

Report  Intersecting  Pairs 

N  lg  N  k 

1 

Exponential 

Intersection  Question 

N  lg  N 

1 

tg-oanplete 

Intersection  with 

Query  Object 

a.  Point 

N 

N 

NP-canplete 

b.  Line 

N 

NP -complete 

NP-aanplete 

c.  Rectangle 

N 

NP-conplete 

NP-oanpiete 

Northernnoet  Rectangle 
Below  a  Line 

N 

NP-oanpiete 

NP-canplete 

Measure  Problen 

N  lg  N 

N 

NP-canplete 

Perimeter  Problen 

N  lg  N 

N 

NP-aanplete 

Connectedness  Problens 

a.  Single  Caponent 

(N+S)  lg  N 

1 

NP-hard 

b.  Number  of  Components 

(N+S)  lg  N 

N 

NP-hard 

c.  Report  all  Carpcnenta 

N  lg  N  +  S-G (S) 

Exponential 

Exponential 

Equality  and  Subset 

N  lg  N 

? 

NP-hard 

Testing  of  Rectangle  Sets 

N  lg  N 

NP-hard 

Table  1.  Complexities  of  Rectangle  Problems. 


In  this  subsection  we  shall  review  the  complexity  of  rec¬ 
tangle  problems  when  their  inputs  are  given  in  geometric 
forms  (or,  equivalently,  given  as  a  list  of  Bax-ctmmands 
in  HIL) .  Rectangle  problems  having  this  input  format  have 
recently  been  extensively  studied  in  the  literature.  See, 
for  instance,  Bentley  and  Wood  [1980],  van  Leeuwen  and 
Wood  [1979],  Vitanyi  and  Wood  [1979],  Nievergelt  and  Prepa- 
rata  [1980],  and  McCreight  [1980].'  We  shall  review  the 
lenown  results  and  sketch  the  basic  techniques  which  have 
been  used  to  obtain  the  results. 

We  will  now  consider  the  first  problem  in  our  list  m 
seme  detail: 

A  naive  algorithm  checks  all  (^;  pairs  of  rectangles  m 

a  given  set  of  N  rectangles  and  thus  solves  the  problsm 

in  quadratic  time.  This  is  optimal  in  the  worst  case 

because  all  N  rectangles  could  intersect,  yielding  .n 

output  of  size  (%} •  Ona  can  do  battar,  howavar,  by 
* 

first  sorting  the  2N  values  of  their  lower  and  upper 
boundaries  and  then  moving  a  scan  line  through  the  set 
bottam-to-top,  keeping  track  of  rectangles  intersecting 
the  current  scan  line.  More  precisely:  Let  us  assume 
that  every  rectangle  R  is  specified  by  the  4  values 
;x  iR>  ,x^  '  R)  ,y  ,  fR)  ,y  .  (R)>  of  their  left,  right,  lower  bettor 
and  upper' too)  boundaries .  v;e  sweep  a  horizontal  3can  line 


SL  through  in#  set  of  rectangles.  At  each  instant  of 
time  the  scan  line  divides  the  set  of  rectangles  into 
three  disjoint  sets:  The  set  of  dead  rectangles  which 
have  been  cut  by  SL,  the  set  of  active  rectangles 
which  are  currently  cut  by  the  scan  line  SL,  and  the 
set  of  sleeping  rectangles  which  will  be  cut  by  SL. 

These  sets  change  only  if  SL  passes  a  lower  or  upper 
ooundary  of  some  rectangle. 

Whenever  a  sleeping  rectangle  becosies  active,  i.e. 
whenever  SL  halts  at  y^  ( R ) ,  we  check  all  currently 
active  rectangles  for  intersection  with  R.  This  stra¬ 
tegy  assures  that  we  do  not  miss  any  pair  of  inter¬ 
secting  rectangles  after  sweeping  SL  once  over  the  whole 
set  of  rectangles.  Consider  an  instant  of  time,  where 
a  sleeping  rectangle  R  becomes  active,  i.e.  when  SL 
halts  at  yg  (R) .  How  can  we  detect  intersection  with  all 
currently  active  rectangles?  Let  us  assusie  that  R  is 
such  a  rectangle.  Then  R  and  R  intersect  iff  their 
projections  to  the  x-axis  overlap,  i.e.  iff  [x  (R),x#(R)J 
and  [x  (R) , x  ( R) )  have  a  nonempty  intersection.  (Thus 
the  scanning  technique  has  reduced  the  intersection 
problem  from  a  two-dimensional  to  a  one -dimensional 
problem. 

Our  above  considerations  show  that  it  is  sufficient 
to  store  the  x-projections  of  the  currently  active  rectangles 
in  a  data  structure  L  such  that  we  are  able  to  answer  the 
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above  question  efficiently.  Furthermore,  L  must  be  dynamically 
altered  during  the  scan-line  sweep:  L  is  initially  empty; 
whenever  a  sleeping  rectangle  R  becomes  active,  its  projec¬ 
tion  to  the  x-axis  is  inserted  into  L,  and  whenever  R  becomes 
dead,  its  projection  to  the  x-axis  is  deleted  from  L. 

The  splitting  of  the  one-dimensional  overlapping  segment 
problem  into  a  range  and  inverse  range  query  suggests  to  choose 
a  pair  of  a  range  and  a  segment  tree  for  L.  Bentley  and  Wood 
[1980]  used  this  to  solve  Problem  1  in  time  0 (N  log  N  +  k) , 
where  k  is  the  number  of  intersecting  pairs,  and  in  space 
0 (N  log  N) . 

McCreight  [1980]  uses  tile  trees  to  improve  that  approach 
and  obtain  a  solution  with  time  complexity  0 (N  log  N  +  k) 
and  space  complexity  0(N)  which  is  optimal.  Bentley,  Haken 
and  Hon  [1980]  use  an  array  of  segment  bins  for  L  which 
yields  a  linear  expected  time  solution  to  the  problem  for 
sets  of  rectangles  occurring  in  real  chip  designs. 

we  will  now  briefly  examine  the  remaining  rectangle  pro¬ 
blems  when  their  input  is  given  in  geometric  form. 

2 .  Intersection  Question 

This  problem  can  be  solved  by  the  same  algorithm  which 
was  used  to  solve  Problem  1 :  Just  stop  it  after  the  first 
intersecting  pair  of  rectangles  was  found,  if  there  is  one, 
or,  if  the  scan  line  has  passed  the  whole  set  of  rectangles. 
Thus,  the  answer  to  question  2  can  be  computed  in  time 
0  (N  log  N)  . 
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However,  in  order  to  detect  intersection  it  is  suffi¬ 
cient  to  keep  the  list  of  left  and  right  boundaries  of  the 
currently  active  rectangles  sorted  according  to  their  x-values 
in  a  simple  AVL  tree  L  during  the  scan  line  sweep  bo t tom- to - 
top.  Whenever  a  rectangle  becomes  active  (respectively  dead) 
its  left  and  right  boundaries  are  inserted  into  L  (respectively 
deleted  from  L) .  There  is  at  least  one  pair  of  intersecting 
rectangles  in  the  set  if  and  only  if  for  at  least  one  rec¬ 
tangle  R  either  the  newly  inserted  left  or  right  boundary  of 
R  is  squeezed  in  between  the  boundaries  of  any  other  active 
rectangle  or  the  boundaries  of  R  are  separated  by  boundaries 
of  any  other  active  rectangle,  i.e.  the  boundaries  of  R  do 
not  become  immediate  successors  in  L.  This  observation  leads 
to  a  simpler  0(N  log  N)  time  algorithm  for  solving  Problem  2. 

Z .  Zmarseozion  wizh  a  Query  Object 

It  is  obvious  that  we  can  detect  intersection  of  a  query 
object  (a  point,  line,  or  rectangle)  with  at  least  one  rec¬ 
tangle  in  a  set  of  N  rectangles  in  linear  time.  We  can  just 
use  the  naive  approach  of  sequentially  comparing  each  rec¬ 
tangle  to  the  object  and  need  no  preprocessing.  If  prepro¬ 
cessing  is  allowed,  it  is  possible  to  maintain  the  set  of 
rectangles  under  sequences  of  insertions  and  deletions  of 
rectangles;  Vaishnavi  and  Wood  [1980]  claim  to  have  a  solution 
to  the  dynamic  version  of  this  problem  which  takes  prepro¬ 
cessing  time  0 (N  log  N)  and  query  time  0(log  N) . 
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4 .  .'iortharnmoat  Rectangle  Below  a  Line 

Both  th«  g*n*r*l  and  th*  ap*cial  casa  of  this  probl*m 
can  obviously  b*  solved  in  linear  tine. 

5 .  Measure  and  Perimeter  Pro'olema 

These  problems  can  be  solved  by  the  scan  line  technique 
which  was  used  to  solve  Problems  1  and  2  above.  When  moving 
the  scan  line  bottom-to-top  through  the  set  of  rectangles 
we  keep  track  of  appropriate  information  about  the  currently 
active  rectangles  like  the  1  -dimensional  measure  of  the  pro** 
jections  of  the  currently  active  rectangles  to  the  x-axis. 
Thus  th*  measure  and  the  perimeter  can  be  accumulated  in  time 
0 (N  log  N) .  See  Bentley  [1977],  van  Leeuwen  and  Wood  [1979], 
and  Vitanyi  and  Wood  [1979]  for  th*  details. 

•) .  Lonneazednaaa  Problems 

Hievergalt  [1981]  gives  a  solution  for  this  problem  which 
uses  the  scan  line  technique:  The  "dual  graph"  is  construc¬ 
ted  on-line  during  th*  sweep  of  the  scan  line  for  the  even 
more  general  case  where  the  given  objects  have  arbitrary 
polygons  as  their  boundaries.  Th*  "dual  graph"  reflects  the 
connectedness  structure  of  the  objects  in  the  set.  If  N  and 
S  are  the  total  numbers  of  edges  and  intersections,  respec¬ 
tively,  th*  algorithm  can  be  carried  out  in  time  0 ( (N+S)  logtJ)  . 
This  unplies  that  all  three  subproblems  of  Problem  6  can 
cer.air.ly  solved  within  the  same  time.  It  is  left  open 
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whether  or  not  an  Improvement  to  0(N  log  N)  for  the  problems 
6a)  and  b)  and  to  0(N  log  N  +  k)  for  problem  6c)  is  possible, 
where  k  denotes  the  number  of  connected  components. 

However,  it  is  easy  to  see  that  one  can  solve  the 
connectedness  problems  in  timeO(N  log  N  +  S  *  G(S)),  for  a 
set  of  N  rectangles  with  S  intersections,  where  G  denotes 
the  inverse  of  the  Ackermann  function.  For,  the  scan  line 
technique  reduces  the  problem  to  the  problem  of  determining 
pairwise  intersections  and  performing  at  most  S  Union  or 
Find  operations. 

7 ,  £' quality  and  Subset  Testing  of  Rectangle  Sets 
By  first  sorting  the  two  sets  of  coordinates  we  obviously 
can  get  a  solution  to  these  problems  in  time  0(N  log  N) ,  using 
the  methods  used  in  Problems  1  and  5 . 
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3 . 2  General  HIL  Input 

In  this  subsection  we  shall  study  the  complexity  of 
rectangle  problems  when  their  inputs  are  given  in  the 
Hierarchical  Input  Language  HIL.  Most  of  the  results  that 
we  will  see  in  this  subsection  are  negative;  that  is,  we 
will  see  that  most  of  the  problems  either  provably  require 
exponential  time  (because  an  extremely  concise  HIL  descrip¬ 
tion  can  generate  exponentially  large  output)  or  are  XP- 
complete  (because  HIL  can  generate  rectangle  sets  in  which 
NP-hard  problems  can  be  encoded) .  Because  all  of  our  proofs 
use  only  two  primary  constructions  (giving  an  exponentially 
large  output  and  encoding  an  NP-hard  problem) ,  we  will  first 
examine  those  two  constructs  in  detail  in  Subsection  3.2.1  , 
and  then  turn  to  the  rest  of  the  problems  in  Subsection  3.2.2. 
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3.2.1  Pairwise  Intersection  Problems 

In  this  section  we  shell  study  two  problems  that  con¬ 
cern  the  pairwise  intersections  among  a  set  of  rectangles 
specified  by  an  HIL  description  of  length  N  (recall  that 
the  length  of  an  HIL  description  is  defined  to  be  the  total 
number  of  Bax  and  Draw  commands  contained  in  the  input) . 

The  two  problems  are  to  report  all  intersecting  pairs  of 
rectangles,  and  to  determine  whether  any  two  pairs  intersect. 
We  shall  first  examine  the  problem  of  reporting  all  inter¬ 
secting  pairs,  which  the  following  theorem  shows  is  diffi¬ 
cult  in  the  worst  case. 

Theorem  1 ; 

The  problem  of  reporting  all  intersecting  pairs  of 
rectangles  defined  by  ar.  HILdescription  of  length  N 
must  sometimes  require  time  exponential  in  N. 

Proof : 

We  will  construct  a  particular  HIL  file  of  length  N 

that  contains  2  '  overlapping  unit  squares  with  lower- 

left  corners  at  the  origin;  because  the  output  must 
N/2 

include  all  (2 2  )  pairs,  it  is  of  size  (2N/2) * (2N/2-1 )/2, 

JJ_1 

or  approximately  2  .  Symbol  1  in  this  file  is  defined 


as 


1  ;  BR  (0,0)  ,(1,1) 
Box  (0,0)  ,(1,1) 
Box  (0,0)  ,(1,11 
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so  it  consists  of  two  unit  squares  with  lower-left 

corners  at  the  origin.  The  ith  symbol,  for  2<,i^N,  is  defined 

as 

i:  BR  (0,0)  ,(1,1) 

Draw  i-1  at  (0,0) 

Draw  i-1  at  (0,0) . 

It  easy  to  prove  by  induction  that  the  ith  symbol  contains 

exactly  21  overlapping  rectangles,  sc  the  N^ectangle  contains 
N 

2  ,  and  the  construction  is  complete.  QED. 

We  will  now  turn  our  attention  to  the  more  subtle  pro¬ 
blem  of  testing  whether  any  two  elements  intersect  in  a  set 
of  rectangles  given  by  an  HIL  of  length  N.  Our  primary  result 
for  this  problem  is  that  it  is  HP-complete?  our  first  step 
toward  showing  this  is  the  following  lemma. 

Lemma  2: 

The  HIL  intersection  question  is  in  HP. 

Proof : 

The  nondeterministic  algorithm  first  guesses  two  rec¬ 
tangles,  then  verifies  that  they  intersect.  A  rectangle 
is  guessed  by  starting  at  the  root  symbol  and  nondeter- 
ministically  following  down  the  call  structure  of  the 
HIL  description  until  a  Box  command  is  reached.  CED. 

Mote  that  the  above  proof  is  concise  precisely  because  a 
certificate  of  intersection  for  a  particular  HIL  is  so  simple: 
we  merely  display  the  two  intersecting  rectangles. 
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The  next  part  of  the  proof  is  the  more  substantial:  we  will 
demonstrate  that  the  HIL  intersection  question  is  in  fact 
NP-hard.  The  reduction  is  to  the  knapsack  problem,  which 
asks  whether  there  is  seme  subset  of  a  set  of  integers  whose 
sum  is  a  given  integer  (see  Garey  and  Johnson  [1979]). 

Lemma  3 : 

The  HIL  intersection  question  is  NP-hard. 

Proof : 

We  will  show  that  the  question  of  whether  some  subset 
of  the  set  of  positive  integers  W  *  {  w1  ,w2, . . .  ,wN  }  suns  to  the 
given  integer  T  can  be  reduced  in  polynomial  time  to 
an  HIL  intersection  problem.  Our  first  step  is  to  define 
in  N+1  HIL  symbols  a  set  of  2N  x-by-x  rectangles  (for 
any  0<x<1  ,  say  x»1  /2)m  whose  left  hand  sides  are  aligned 
along  the  integers  frem  0  to  2N-1  and  whose  bottom  sides 
have  heights  corresponding  to  the  sums  of  all  21*  subsets 
of  weights.  Symbol  0  is  defined  as 

0:  BR  (0,0)  ,  (x,x) 

Box  (0,0) , (x,x) 


*  Note  that  we  have  taken  a  liberty  with  the  definition  of  HIL  by 
using  a  rectangle  of  the  neninteger  size  (1/2, 1/2) .  This  could  easily 
be  fixed,  but  that  would  only  obscure  the  structure  of  the  proof. 
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For  liiiN,  symbol  i  is  defined  as 

i:  BR  (0,0)  ,  (2X-1  +x,  x  +  I  w  ■) 

11]  ii  J 

Draw  i-1  at  (0,0) 

Draw  i-1  at  (2i_1,wi). 

Note  that  the  heights  of  the  bottoms  of  the  rectangles 
in  symbol  i  represent  the  sums  of  all  the  subsets  of 
<;w  ,w,  ,  w  } ;  this  is  easily  proved  by  induction. 

Now  that  we  have  represented  the  sums  of  all  the  subsets 
by  a  sequence  of  rectangles  at  various  neigths,  we  must  do 
the  same  for  the  desired  sum  T.  Symbol  N+1  is  defined  as 

N+1  :  BR  (0 ,  T)  ,  ( 1  +x  ,x  ) 

Draw  0  at  (0,T) 

Draw  0  at  (1  ,T)  ; 

it  places  two  x-by-x  rectangles  at  height  T.  We  then  copy 
those  rectangles  by  defining,  for  2<i<N, 

N+i:  BR  (0 ,  T)  ,  (21-1+x,x) 

Draw  N+i-1  at  (0,0) 

Draw  N+i -1  at  (2i_1 ,0) . 

N 

Note  that  the  symbol  N+N  =  2N  consists  of  a  row  of  2 

rectangles  with  bottoms  at  height  T  and  left  sides  along 

N 

the  integers  frcm  0  to  2*  -1  . 

The  stage  is  now  completely  set;  the  final  symbol  is 
defined  as 

2N+1 :  BR(C,0) (2N-1 +x ,  x+  I  w.) 

liiiN  1 

Draw  N  at  (0,0) 


Draw  2N  at  (0,0) 
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and  there  is  a  solution  to  the  knapsack  problem  if 
and  only  if  sane  pair  of  rectangles  in  symbol  2N+1 
intersect  (for  by  the  distinctness  of  x-values  in 
symbols  N  and  2N,  two  rectangles  intersect  if  and  only 
if  they  share  the  same  y-value  of  T) .  QED. 

An  example  of  the  construction  used  in  this  proof  is  given 
as  Example  1 .  The  two  above  lemmas  can  now  be  combined  to 
prove  Theorem  4. 

Theorem  4 : 

The  HIL  intersection  question  is  NP-complete. 

Proof : 

Immediate  from  Lemmas  2  and  3. 
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Example  1 

In  this  example  we  will  see  how  a  particular  instance 
of  the  knapsack  problem  can  be  reduced  to  a  problem  of  tes¬ 
ting  for  intersection  in  a  rectangle  set  defined  in  HIL.  We 
will  assume  that  the  set  W  is  {2,3,5}  and  we  want  to  know 
whether  any  subset  sums  to  5.  Symbols  1,2,3  are  illustrated 
below. 
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Symbol  1  Symbol  2  Symbol  3 

Symbol  6  has  the  structure 


4  1 


0  - I - 1 - !—! - 1 - 1 - 1 - 1 - 

12  3  4 

When  symbols  3  and  6  are  overlayed  they  have  two  intersections, 
at  (3,5)  and  (4,5);  these  correspond  to  the  subsets  {2,3}  and 
(5;,  both  of  which  sum  to  5.  End  of  Exanvole. 
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Because  we  will  make  extensive  use  of  the  construction 
used  in  the  proof  of  Lemma  3  ,  it  is  important  that  we  ana¬ 
lyze  the  construction  in  scsne  detail.  The  first  aspect  to 
note  is  that  because  we  reduced  the  HIL  intersection  ques¬ 
tion  to  the  knapsack  problem,  which  is  known  to  be  solvable 
in  pseudo-polynominal  time,  we  have  shown  only  the  weak 

NP-campleteness  (and  not  the  strong  NP-ccmpleteness)  of 

* 

the  HIL  intersection  question  .  This  is  an  essential  aspect 
of  the  proof,  because  the  HIL  intersection  question  is  it¬ 
self  solvable  in  pseudo-polynomial  time  (that  is,  it  is 
solvable  in  polynomial  time  if  the  inputs  are  expressed  in 
unary) .  To  prove  this  it  suffices  to  observe  that  both  the 
height  and  width  of  the  minimum  bounding  rectangle  of  the 
root  symbol  are  less  than  the  sum  of  all  the  input  para¬ 
meters;  thus  all  the  symbols  in  the  set  must  be  placed  on 
an  integer  grid  of  size  at  most  the  square  of  the  input 
length.  We  can  record  for  each  cell  in  the  grid  which 
of  the  linear  number  of  symbols  have  already  been  instan¬ 
tiated  there,  and  thus  avoid  doing  more  than  a  polynomial 
amount  of  checking. 


*  For  a  discussion  of  the  notion  of  weak  NP-ccmpleteness, 
see  Garey  and  Johnson  [1979], 
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The  second  important  fact  to  note  is  that  the  HIL  descrip¬ 
tions  constructed  in  the  proof  of  L senna  3  are  in  a  very 
well-behaved  subset  of  HIL.  The  most  Important  propertv 
is  that  2N+1  of  the  2N+2  symbols  are  consistent  in  the  sense 
that  the  bounding  boxes  of  their  symbols  do  not  intersect; 
only  symbol  2N+1  has  intersecting  subsymbols.  Thus  the  HIL 
intersection  question  remains  NP-complete  even  when  we  con¬ 
strain  the  HIL  input  to  contain  at  most  one  pair  of  over¬ 
lapping  called  symbols.  There  are  a  number  of  other  senses 
in  which  the  graph  of  the  construction  is  well-behaved: 
it  uses  only  one  Box  command,  every  symbol  (except  0)  makes 
exactly  two  calls,  and  every  symbol  (  except  0)  is  called 
exactly  twice  by  exactly  one  other  symbol. 
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3.2.2  Otner  Proplprs  wnn  H I L  Input 

In  the  previous  subsection  we  saw  the  following  results 
for  rectangle  problems  with  inputs  m  HIL. 

3 .  her  or  t  Intersect  inn  Pairs 

Theorem  1  showed  by  a  counting  argument  that  this 
problem  must  require  exponential  time  in  the  worst  case. 

: .  Intersection  lueatior. 

Theorem  4  showed  that  this  problem  is  (weakly)  NP- 
ccoplete,  using  a  reduction  to  the  knapsack  problem. 

We  will  now  briefly  examine  the  remaining  rectangle  questions  and 
problems  when  their  inputs  are  given  in  HIL. 

o .  Intersection  with  a  Q'uerh  iblect 

We  will  consider  the  three  subproblems  in  decreasing 
order  of  generality. 

a)  Intersection  with  a  Rectangle 

To  prove  this  problem  NP-complete  we  will  use  the 

construction  in  the  proof  of  Lemma  3  and  replace 
N 

the  2  squares  at  height  T  with  one  rectangle  of 

N 

height  x  and  width  2  -1+x,  with  bottom  left  corner 
at  (0, T) .  The  knapsack  problem  has  a  solution  if 
and  only  if  that  rectangle  intersects  any  other. 

b )  Intersection  with  a  Line 

Replace  the  query  rectangle  in  the  above  construction 

by  a  horizontal  line  with  height  T+x/2. 

a.)  Intersection  with  a  Point 

Use  the  construction  of  parts  a  and  b,  but  place  all 
squares  to  have  their  left  edges  on  the  line  x«0. 

The  query  point  at  (x/2, T+x/2)  intersects  a  rectangle 
if  and  only  if  the  knapsack  problem  has  a  solution. 
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4 .  Vorthernmoat  F.eozangla  3«low  a  Lin a 

The  general  problem  can  be  proved  NP-ccmplete  by  the 
proof  of  Problma  3b)  if  the  northernmost  rectangle 
below  the  line  y»  T+i -€  is  at  y»T ,  then  and  only  then 
does  the  knapsack  problem  have  a  solution.  The  spe¬ 
cial  case  of  finding  the  northernmost  rectangle  in 
the  entire  set  is  solvable  in  linear  time  by  computing 
"bottam-up“  the  northernmost  rectangle  in  every  symbol 

5 .  Meaaupa  and  Parimstsv  Problsms 

To  prove  these  problems  NP-ccmplete  we  will  use  the 

construction  in  the  proof  of  3a,  in  which  the  sums 

of  all  subsets  are  represented  by  a  set  of  rectangles 

at  a  vertical  line.  We  first  construct  such  a  set  and 

compute  its  measure,  then  augment  the  old  set  by  a 

single  x-by-x  square  at  height  T;  the  new  measure 
2 

is  x  greater  than  the  old  if  and  only  if  there  is  no 
solution  to  the  knapsack  problem.  Likewise,  the  peri¬ 
meter  increases  by  4x. 

■?.  Connestadness  Problems 

a)  Ars  the  P.estancle 3  in  a  Sinaia  Conneczed  Psnrcne 
We  will  show  that  this  problem  is  NP-hard  by  using 
the  same  constructions  as  for  problems  3a  und  5.  We 
first  place  all  squares  with  their  bottom  sides  at 
the  sums  of  all  subsets  and  their  left  sides  along 
the  y-axis.  We  then  place  a  rectangle  of  width  1 


and  height  equal  to  x  plus  the  sum  of  all  weights 
in  W  to  have  its  right  side  along  the  y-axis.  Note 
that  at  this  time  all  of  the  rectangles  form  a 
single  connected  component.  Finally ,  we  place  a 
single  (x/2,x)  rectangle  at  (x/2,T)j  that  rectangle 
is  in  the  single  connected  component  if  and  only 
if  the  knapsack  problem  has  a  solution.  Note  that 
this  proof  shows  only  that  the  problem  is  NP-hard, 
and  not  that  it  is  in  NP. 
o)  Number  of  Connected  Components 

By  the  reduction  of  6a,  this  problem  is  also  NP-hard. 
a)  Enumeration  of  Alt  Connected  Components 
The  construction  used  in  the  proof  of  Theorem  1 
can  be  slightly  augmented  to  produce  in  N  symbols 
a  total  of  2N  disjoint  squares?  the  time  to  output 
them  alone  shows  that  this  problem  must  require 
exponential  time. 


7 .  Equality  and  Subset  Testing  of  Rectangle  Sets 

The  construction  used  in  question  6a  shows  that  these 
problems  are  NP-hard. 
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3 . 3  Consistent  HIL  Input 

In  this  subsection  we  shall  study  the  complexity  of 
rectangle  problems  when  their  inputs  are  given  as  consistent 
HIL  descriptions.  That  is,  for  every  symbol  in  the  HIL  de¬ 
scription,  the  bounding  boxes  of  all  objects  within  the 
symbol  must  be  contained  within  the  symbol's  bounding  box 
and  nonoverlapping.  This  restriction  has  very  different 
impacts  on  the  complexity  of  the  various  rectangle  problems: 
whereas  most  of  the  problems  with  general  HIL  input  were 
NP-hard,  we  will  see  that  for  consistent  designs  seme 
problems  remain  NP-hard,  while  other  problems  are  now  sol¬ 
vable  in  polynomial  time  (usually  linear  or  S(N  lg  N)),  and 
still  others  beccme  trivial  (that  is,  they  earn  be  solved 
in  constant  time) . 

The  first  problem  that  we  must  face  when  dealing  with 
consistent  designs  is  that  of  verifying  that  an  allegedly 
consistent  design  does  in  fact  satisfy  the  properties  of 
having  only  contained  and  nonoverlapping  subsymbols.  This 
property  is  easy  to  verify  using  the  scanning  algorithm  to 
solve  Problem  2  in  Subsection  3.1  (that  is,  given  N  rec¬ 
tangles  in  the  plane,  do  any  intersect?) ;  recall  that 
9 (N  lg  N)  time  is  necessary  and  sufficient  to  test  this 
property.  Our  algorithm  for  testing  consistency  will  now 
proceed  bottam-up  through  the  HIL  design,  symbol -by-symbo 1 , 
and  use  the  geometry-only  intersection  checker  to  verify 
that  no  subsymbols  in  any  symbol  intersect;  it  is  trivial 
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to  ensure  in  linear  time  that  all  subsymbols  are  in  fact 
contained  within  the  symbol's  bounding  rectangle.  By  the 
fact  that  there  are  a  total  of  0(N)  rectangles  and  calls 
on  subsymbols  altogether  in  the  HIL  and  the  fact  that  the 
function  0(N  lg  N)  is  concave  upward,  the  entire  algorithm 
takes  at  most  0(N  lg  N)  time.  Note  that  6(N  lg  N)  is  ne¬ 
cessary  for  the  case  of  a  one-level  consistent  design,  so 
this  bound  is  best  possible. 

We  turn  now  to  study  the  rectangle  problems  themselves. 

The  first  two  problems  are  trivial  for  designs  that  we 
know  to  be  consistent. 

1 .  Report  Intersecting  Pairs 

2 .  Intersection  Question 

Both  of  these  problems  can  be  answered  in  constant 
time  because  a  consistent  design  is  known  to  have  no 
intersecting  pairs  of  rectangles. 

The  remaining  problems  do  not  admit  trivial  solutions. 

3 .  Intersection  with  a  Quern  Object 

We  will  consider  the  three  subproblems  in  increasing 
order  of  generality. 

a )  Intersection  with  a  Point 

This  problem  can  be  solved  in  linear  time.  To  see 
if  any  of  the  symbols  intersects  a  given  point  we 
start  at  the  root  symbol  and  then  recursively  search 
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down  the  DAG  that  is  the  HIL  description;  at  each 
symbol  we  visit  at  most  one  of  its  subsymbols. 

(Note  that  if  we  have  to  visit  more,  then  the  sub- 
symbols  overlap,  which  violates  consistency) . 

b J  Interaeetion  with  a  Line 
2 )  Interaeetion  -jith  a  Rectangle 

Because  the  HIL  descriptions  used  in  Subsection  3.2 
to  prove  the  NP-hardness  of  these  questions  for 
general  designs  were  in  fact  consistent,  both  pro¬ 
blems  b  and  c  remain  NP-camplete  for  consistent 
designs. 


4 .  .Vcrthemmoat  Reetgngle  3elow  a  -ine 

Because  the  proof  in  Subsection  3.2  uses  a  consistent 
design,  this  problem  remains  NP-ccmplete. 

5 .  Aeaaure  and.  Perimeter  Problems 

These  problems  are  both  solvable  in  linear  time.  To 
solve  the  measure  problem  we  proceed  bottcm-up  through 
the  set,  ccmputing  for  each  symbol  the  sum  of  the 
measures  of  the  rectangles  it  contains  by  adding  to¬ 
gether  the  (previously  calculated)  measures  of  the  sub¬ 
rectangles  it  calls.  The  perimeter  problem  is  solved 
in  a  similar  fashion. 

6 .  Co  nneotedr.eaa  Problems 


a)  Are  the  Rectangles  in  a  Si 

The  answer  is  yes  if  and  only 
one  rectangle  in  the  set. 


Yiala  Connected.  :crv cyiev,-~ i 

if  there  is  exactly 
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b)  Humber  of  Connected  Components 

This  can  be  solved  in  linear  time  by  a  bottom-up 
algorithm  like  the  algorithms  used  to  3olve  problem  5. 
a)  Enumeration  of  All  Connected  Components 
The  construction  used  in  Subsection  2.2  to  show 
that  this  problem  can  require  exponential  time 
still  holds. 

7 .  Equality  and  Subset  Testing  of  Rectangle  Sets 

We  leave  these  as  open  problems. 
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4.  IMPLICATIONS  AND  OPEN  PROBLEMS 

The  purpose  of  this  paper  has  been  to  lay  a  solid 
theoretical  foundation  for  the  manipulation  of  hierarchi¬ 
cally  defined  sets  of  rectangles  in  the  plane.  There  are 
two  motivations  for  this  approach:  the  questions  are  inter¬ 
esting  from.  a  purely  mathematical  viewpoint,  and  the  theory 
can  occasionally  have  a  substantial  impact  on  practice. 

For  instance,  the  asymptotic  worst-case  rectangle  inter¬ 
section  algorithm  of  Bentley  an d  Wood  [1980]  motivated  the 
efficient  expected-time  algorithm  of  Bentley,  Haken  and 
Hon  [1980],  which  was  in  turn  used  in  the  VLSI  Design  Rule 
Checker  of  Haken  [1980]. 

The  primary  results  of  this  paper  are  summarized  in 
Table  1  .  The  first  column  of  that  table  surveys  previous 
work  on  geometrically  defined  rectangle  problems,  the 
third  column  shows  that  most  problems  are  NP-hard  when 
presented  with  unrestricted  HIL  inputs,  and  the  second  column 
shows  that  when  the  designs  are  constrained  to  be  consistent, 
then  most  of  the  problems  become  rather  easy  to  solve.  These 
facts  correspond  closely  to  the  experience  of  Hon  [1980] 
and  Whitney  [1980]  in  using  their  programs  that  manipulate 
hierarchical  VLSI  designs:  highly  structured  designs  (which 
are  never  consistent  but  usually  rather  close  in  seme  sense) 
can  be  processed  very  quickly,  while  highly  unstructured 
designs  require  prohibitive  amounts  of  processing  time. 


It  is  important  to  state  carefully  the  implications  of 
the  above  results  for  the  builders  of  systems  that  process 
hierarchical  designs.  The  NP-hardness  results  do  not  state 
that  such  designs  cannot  be  processed  efficiently;  rather, 
they  imply  that  it  is  highly  unlikely  that  one  can  ever  find 
an  algorithm  that  will  efficiently  process  every  design. 

Thus  one  should  not  search  for  such  an  algorithm,  but  rather 
focus  one's  energy  on  algorithms  that  work  well  for  an 
important  subclass  of  designs. 

There  are  two  types  of  subclasses  that  might  be  inves¬ 
tigated,  and  both  appear  to  offer  much  to  theoretician  and 
practitioner  alike. 

1 .  A  Statistical  Avvroach 

Using  this  approach  one  would  first  build  a  probabilis¬ 
tic  model  of  VLSI  designs,  and  then  design  an  algorithm 
that  performs  well  on  the  average  when  the  inputs  are 
drawn  from  that  distribution.  (This  is  the  approach 
taken  for  the  geanetry-only  rectangle  intersection 
problem  by  Bentley,  Haken  and  Hon  [1980]).  Devising  a 
probabilistic  model  that  includes  both  the  graph- 
theoretic  aspects  of  the  HIL  structure  and  the  geometric 
aspects  of  the  shapes  and  placement  of  the  rectangles 
is  a  subtle  mathematical  problem;  fitting  such  a  model 
to  actual  data  will  require  am  exceptionally  talented 
practitioner. 
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2 .  A  Restriction  Approach 

We  saw  that  restricting  the  designs  to  be  consistent 
allowed  many  of  the  problems  to  be  solved  quite  effi¬ 
ciently.  Unfortunately,  consistency  is  so  restrictive 
that  no  real  designs  can  be  built  using  it!  tve  therefore 
observe  a  tradeoff  between  severe  restrictions  (which 
exclude  many  designs  but  facilitate  rapid  processing) 
and  lax  restrictions  (which  exclude  few  designs  but 
admit  many  that  are  very  time  consuming  to  process). 

It  will  be  important  to  identify  families  of  restric¬ 
tions  that  exclude  only  a  few  designs  (and  hopefully 
uninteresting  ones  at  that)  but  admit  to  very  rapid 
processing  of  the  remaining  designs. 

The  NP -completeness  results  of  Subsection  3.2  have 
a  rather  interesting  implication  for  this  endeavor. 
Recall  that  the  Rectangle  Intersection  Question  is 
NP-complete  when  the  inputs  are  presented  in  HIL.  Many 
people  suspect  that  this  Implies  that  the  complement 
of  the  problem  is  not  even  in  NP  (see,  for  instance, 
Garey  and  Johnson  [1979,  Section  7]).  This  in  turn 
would  imply  that  there  can  never  be  a  polynomial-length 
certificate  of  nonintersection  for  a  set  of  rectangles. 
This  means  that  if  a  restriction  approach  is  taken  in 
which  the  designer  adds  a  polynomial  amount  of  extra 
information  and  the  resulting  design  can  be  processed 
in  polynomial  time,  then  some  valid  designs  must  ne¬ 
cessarily  have  been  excluded. 
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